Goto

Collaborating Authors

 opencv and tesseract ocr


Extract text from memes with Python, OpenCV and Tesseract OCR

#artificialintelligence

Extracting text information from an image can serve different scopes. In our case, we needed to extract text to enhance the performance of our multi-modal sentiment classification model based on tweets accompanied by images. Since we found that the most common reaction pic that can be found on social media are formatted as MEMEs, we developed a pipeline to extract text from images formatted like that, and in this article, we'll present it. Currently (Nov 2020), the state of the art in text extraction through OCR methods is represented by Google Tesseract OCR, which is the most used open-source software to deal with this task. Tesseract is easy to install (following this link) and use in a python environment, through the pytesseract library.